Feeds to Scour
SubscribedAll
Scoured 18232 posts in 740.5 ms
A Two-Stage GPU Kernel Tuner Combining Semantic Refactoring and Search-Based Optimization
arxiv.orgยท1d
๐Ÿ•ฏ๏ธCandle ML
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.comยท16hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.comยท2d
โšกHardware Acceleration
Preview
Report Post
Meet Z.AI 4.7 Flash, a Low-Cost Local AI Model for Coding & Smart Tasks
geeky-gadgets.comยท18h
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
AI Systems Performance Engineering
github.comยท7hยท
Discuss: Hacker News
๐Ÿ“…Resource Scheduling
Preview
Report Post
Uncovering Unfaithful CoT in Deceptive Models
lesswrong.comยท6h
๐Ÿ›ก๏ธAI Security
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.comยท14h
โšกHardware Acceleration
Preview
Report Post
Artificial Intelligence
radiofreemobile.comยท1d
๐Ÿ†•New AI
Preview
Report Post
Streamlining CUB with a Single-Call API
developer.nvidia.comยท10h
๐ŸŸ๏ธArena Allocators
Preview
Report Post
Blackbox Optimization and Hyperparameter Tuning With Google's Vizier
blog.skz.devยท1d
๐Ÿ’ฐCost-Based Optimization
Preview
Report Post
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
machinelearning.apple.comยท1d
๐Ÿ•ฏ๏ธCandle ML
Preview
Report Post
Edge AI: The future of AI inference is smarter local compute
infoworld.comยท2d
๐Ÿ“ฑEdge AI Optimization
Preview
Report Post
Show HN: Multi-cloud cost visibility with latency rings and GDP data
news.ycombinator.comยท14hยท
Discuss: Hacker News
๐Ÿ—๏ธInfrastructure Economics
Preview
Report Post
CUDA Programming: From Zero to GPU Kernels
pythongiant.github.ioยท21hยท
Discuss: Hacker News
โšกHardware Acceleration
Preview
Report Post
ANN v3: 200ms p99 query latency over 100 billion vectors
turbopuffer.comยท1dยท
Discuss: Hacker News
๐Ÿ”ฎPrefetching
Preview
Report Post
Hot 24h update on the AI x Web3 stack thatโ€™s taking the space by storm:
threadreaderapp.comยท1h
๐Ÿ–ฅGPUs
Preview
Report Post
Generative AI as a Non-Convex Supply Shock: Market Bifurcation and Welfare Analysis
arxiv.orgยท1d
๐Ÿ’นPlatform Economics
Preview
Report Post
Building a Regulatory Risk Copilot with Databricks Agent Bricks (Part 1: Information Extraction)
databricks.comยท12h
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
The Disequilibrium Advantage - Log
nibzard.comยท1d
๐Ÿ’ฐTokenomics
Preview
Report Post
Managing HWRT in Instance-Heavy Scenes
real-mrbeam.github.ioยท19hยท
Discuss: Hacker News
๐Ÿ› ๏ธBuild Optimization
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help